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Abstract 

The type IV pilus retraction motor is found in many important bacterial pathogens. 
It is the strongest known linear motor protein and is required for bacterial in- 
fectivity. We characterize the dynamics of type IV pilus retraction in terms 
of a stochastic chemical reaction model. We find that a two state model can 
describe the experimental force velocity relation and qualitative dependence of 
ATP concentration. The results indicate that the dynamics is limited by an 
ATP-dependent step at low load and a force-dependent step at high load, and 
that at least one step is effectively irreversible in the measured range of forces. 
The irreversible nature of the sub-step(s) lead to interesting predictions for 
future experiments: We find different parameterizations with mathematically 
identical force velocity relations but different fluctuations (diffusion constant). 
We also find a longer elementary step compared to an earlier analysis, which 
agrees better with known facts about the structure of the pilus filament and 
energetic considerations. We conclude that more experimental data is needed, 
and that further retraction experiments are likely to resolve interesting details 
and give valuable insights into the PilT machinery. In light of our findings, the 
fluctuations of the retraction dynamics emerge as a key property to be studied 
in future experiments. 



Introduction 

The type IV pilus (TFP) retraction motor is interesting and important for 
several reasons. Not only is it necessary for the infectivity of many human 
pathogens Q), it is also the strongest known linear motor protein, making it an 
interesting test ground for understanding generation of large forces in nanosys- 
tems. The physics of pilus retraction has recently been studied experimentally 
(JiJSQ), an d it is of interest for theoretical study, which we present here. 

Physical modeling of motor proteins has seen a rapid development recently, 
made possible by improved experimental techniques that enable measurement 
on the single molecule level. It is possible to apply general thermodynamic 
considerations to construct models which do not require fully detailed knowledge 
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of the molecular details of the system. This can lead to new insights about the 
molecular mechanism, make predictions that can be tested experimentally, and 
suggest new experiments. 

We will now briefly review the relevant facts known about the PilT system. 
After that, we introduce the model with emphasis on the underlying assumptions 
and the interpretation of the model parameters. TFP are surface filaments 
crucial for initial adherence of certain gram-negative bacteria to target host cells, 
DNA uptake, cell signaling, and bacterial motility. The pili filaments consist 
of thousands of pilin subunits that polymerize to helical filaments with outer 
diameter of about 6 nm, a 4 nm pitch and 5 subunits per turn Bacterial 
motility (twitching motility) is propelled by repeated extension and retraction of 
TFP, by which the bacterium can pull itself forward on surfaces like glass plates 
or target host cells (Q)- During retraction, the filament is depolymerized, and 
the pilin subunits are stored in the cell membrane (0) . TFP are expressed by a 
wide range ofgram-negative bacteria Jlj) including pathogens such as Neisseria 
gonorrhoeae {3) , Myxococcus xanthus (0) and Pseudomonas aeruginosa . The 
retraction process is believed to be mediated by a protein called PilT, which 
is a hexameric (0) motor protein in the AAA family of ATPases (0). Pilus 
retraction in N. gonorrhoeae generates forces well above 100 pN (0; 0), making 
PilT the strongest known linear motor protein. The large force combined with 
the hexameric structure of PilT indicates that retraction of a single pilin subunit 
may involve hydrolysis of several ATP molecules. 

The physics of pilus retraction has so far been studied experimentally in 
some detail (0; S The experimental data on retraction velocity has pre- 
viously been analyzed theoretically using a model with one chemical reaction 
step with an Arrhenius type force dependence, which describes the retraction 
at high loads (0). In this paper, we will extend the analysis to two different 
reaction steps, which is sufficient describe the existing data for loads. We will 
also make predictions about fluctuations in the retraction process, which are 
experimentally accessible. This turns out to be crucial as we consider models 
with constrained load patterns. We find two families of parameterizations that 
describe the data fairly well, give exactly the same velocity, but predict qualita- 
tively different fluctuations. The constrained models also give a partial explana- 
tion for the surprisingly short elementary step in the one state model: Adding 
an extra state to account for behavior a low loads changes the interpretation 
of the data for high loads, resulting in a longer elementary step. Nevertheless, 
one would like to account for an even longer elementary step, to make the de- 
scription compatible with the known structure of the pilus filament. This work 
represents an important improvement in this respect. It also indicates that the 
pilus retraction system hides more interesting dynamics, and that fluctuations 
are the key property to gain further insights about this remarkable motor. 

Mechano chemical step model 

Discrete mechanochemical models are well established in the description of mo- 
tor proteins, and have been used successfull y to describe the motion of proteins 
that walk along structural filaments llu lllj LLSJ; LLiS lidf lla) • The main idea 
is to describe the motion in terms of stochastic transitions between meta-stable 
states in the reaction cycle of the protein. The starting point is random walk 
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Figure 1: N = 2 mechanochemical step model. 



between nearest neighbor states on a one-dimensional lattice. Each lattice point 
corresponds to a meta-stable state in the reaction that drives the motion. 

We consider an unbranched reaction with a period of N steps, in which each 
state is connected to two nearest neighbors. We denote state j in period k by 
jk, which corresponds to a position x{jk) — x(jo) + kd along the track, where 
d is the spatial period of the reaction (the elementary step). The reaction is a 
Markov process with non-negative forward and backward transition rates Uj and 
Wj respectively, as illustrated in figure ^ for the case N = 2. An exact solution 
due to Derrida gives analytic expressions for the steady state velocity v 
and diffusion constant D for arbitrary transition rates and period TV <|12t Il7|) . 

A model of this form can, at least in principle, be derived from a theory of 
the microscopic degrees of freedom <[18i llflTI , if we assume that the motor action 
consists of fast reactions between meta-stable sub-states, which are wells in a 
free energy landscape. Even so, the model does not have to include all actual 
reactions; quickly decaying states can be lumped together with slower, rate- 
limiting ones (jljj). An alternative to the above mechanism of a fast "working 
step" that produces work is the "power stroke" , in which a fast reaction loads 
potential energy to an internal degree of freedom, which in turn does work 
through relaxation (|20T) . Work following this approach is under way. 

Following Fisher and Kolomeisky ifla Hit) , we take a mechanochemical step 
model as a phenomenological starting point, and assume that the chemical re- 
actions obey an Arrhenius law <|lfl). The non- negative transition rates depend 
on an opposing load F, parallel to the track, and take the form iflfl) 

Uj = u 3 e- Fc i/ k * T , Wj = W] e+ Fd il k * T (1) 

where k^T is Boltzmann's constant times temperature, Ui, Wj are force inde- 
pendent rate constants, and Cj,dj are forward and backward load distribution 
lengths. Cj, dj can be interpreted as positions of the activation barriers projected 

along the track. The load lengths are expected to obey Y^jLii^j + c j) = d, where 
d is the spatial period of the motion (the elementary step), which we will dis- 
cuss below. One can identify sub-steps of size (dj + Cj) between jk and {j + l)k 

The simple form of the reaction rates and the correspondence between load 
lengths and actual distances is not universally valid. It rests on assumptions 
that the free energy wells corresponding to the sub-states are narrow and sim- 
ilar in shape, and that the force is not too large to make thermal fluctuations 
unimportant in the reaction process jlSft . 

In the following, we will consider N = 2. Thete is not enough experimental 
data to fit parameters for higher order models, and an N = 1 model can only 
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describe the behavior in a limited range of forces (0). Retraction experiments 
suggest a process with an ATP-limited step at low loads and a force-limited step 
at high loads (@), which suggests that a two-state model is necessary to describe 
the full dynamics. The steady state velocity given by iflrl IrH Il2h 

v = li m (- T ( f )) - (uqUi - wpwi)d ^ 

t^oo t Uq + Ul + Wl + Wo 

and the diffusion constant is 

d 2 d 2 

D = — (uqUx + WqWx) -(u Ul - WqW!) 2 (3) 

la a- 3 

The diffusion constant D is defined as [(x 2 (t)) — (x(t)) 2 ] /2t in the limit t — > oo. 
As is evident from the above equations, v and _D do not depend on which 
backward rate is associated with which forward rate. Therefore it is not possible 
to uniquely identify sub-steps from knowledge of v and D in a two state model. 



Modeling retraction data 

We fitted Eqs. (1112(1 to retraction data for wild type N. gonorrhoeae at 330 K 
from Ref. with a maximum likelihood (ML) estimate. Assuming Gaussian 
errors, this amounts to minimizing — 21nL = J2 n (v(F n ) — v n ) 2 /a 2 <|2l|) . where 
F n , v n and er„ are experimental forces, velocities and error bars respectively, 
and L is the (Gaussian) likelihood function. The period d is not an independent 
parameter in the velocity, and was included in the transition rates. As seen in 
Fig. [3 the data is well described. The best fit parameters of the unrestricted 
N = 2 model, which we denote N2, are shown in Tab.Q] along with parameter 
sets which are optimal when the parameter space is restricted in different ways. 
We denote the restricted parameterizations A and B, and also include Nl, the 
high force analysis of Ref. (0), which we reproduce for reference. Some parame- 
ters in N2 are extremely small, and can be put to zero without any significant 
difference in velocity, which is how the A parameterizations is constructed. This 
does not imply that one reaction is completely irreversible, which would violate 
detailed balance iflflh . but rather that one backward rate is small enough to be 
negligible in the fit. 1 

There is no obvious relation between the load distribution lengths of A and 
Nl, which is surprising since Nl is a good description of the high force regime, 
ft is therefore tempting to ask whether the force velocity relation can be equally 
well described by other parameterizations, with at least one irreversible step 
and only one force-dependent step. The minimal way to do this is to remove 
the backward step completely from A. This constrained parameterization, B, 
cannot be statistically rejected even at 80% confidence. However, the rate- 
limiting step in B is still twice as large in Nl, a result which is retained even 
if only high forces are considered (not shown). The reason for this difference is 
the presence of a force independent step. To show this, we rewrite the N = 1 
velocity in the form of an N = 2 model with two equal consecutive reactions. 

Interestingly, a discrete N = 2 model with one irreversible step can be derived as a limiting 
case of certain class of simple ratchet models ilTtl . 
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For simplicity, we neglect the weak force dependence of the backward rate. With 
the notation p( x ' for parameter p in parameterization X , we get: 



2d< N1 ) ((4 N1) ) 2 e- 2 ^ (N1, ABT _ (w ( N1 ))2) 
/N1) = 2(4 N1) e-™r ) /^T + w (ND ) 



(4) 



At forces where the numerator is dominating the force dependence, this is similar 
to an N = 2 model with one force independent step and twice as long load dis- 
tribution length on the other step. The factor 2 in numerator and denominator 
shows the fact that the elementary step is also twice as long, if the denominator 
is brought to the same form as Eq. ©, with four terms. We conclude that the 
plateau in v(F) at low loads have implications for the interpretation of data at 
high loads, and that the earlier estimate of the elementary step in Ref. fljj) is 
too small by (at least) a factor 2. 

The B parameterization has another interesting property. It has exactly the 
same force velocity relation as two different parameter families, C and D, which 
we now describe. To construct C from B, one adds an arbitrary force indepen- 
dent backward reaction. To keep the velocity unchanged, the force dependent 
forward rate must be modified, but all other parameters in B are kept. 

wP =Rc,dP = 0,«<°> =4 B) G^) (5) 

Here, the constant Rc is non-negative but otherwise arbitrary, and we retrieve 
the B model in the limit Rc — > 0. To construct the D parameter family from 
B, we move the load dependence from the forward step to a new (arbitrary) 
backward rate, and modify both forward rates. 

w™ = r d , r d > (24 B) ) 2 M B ) , 4 D) = 4 B) 

(D) _ u^Rp ( , f. (2n< B >)2 \ r (D)„ (6) 

Rd again defines a parameter family, and the choice of sign in the lower equation 
is irrelevant for v and D. 

Using the obtained parameters, we can compute the diffusion constant D. 
In the context of molecular motors, diffusion is commonly analyzed in terms 
of the dimensionless randomness parameter r = 2D/vd jl5 IT3: Hit fliit E^t 
Eih . which is shown in Fig. Et- The randomness is convenient in this case, 
as it is independent of d, which is unknown for the PUT motor. Although 
parameterizations B, C and D give equivalent force- velocity relations, they 
make different predictions for the diffusion constant. We see that B and C make 
qualitatively different predictions than D, and that A is quite close to B at high 
forces, but deviating at F < 60 pN. We expect a measurement of the diffusion 
constant D would discriminate between the different parameterizations. N2 is 
again indistinguishable from A (not shown). 

The velocity dependence on ATP concentration ([ATP]) has been studied 
in experiments on ATP depleted bacteria, and two regimes where found 
At low loads, the velocity was strongly dependent on [ATP], but at high loads, 
the velocity was the same for the depleted strain as for the undepleted strain 
( "wild type" ) . A simple way to include ATP dependence in a mechanochemical 
step model is to make one forward rate ('binding reaction') proportional to 
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Table 1: Maximum likelihood estimates of parameters. The products dui, dwj 
are given in units of nm/s, and load lengths are in nm. d is the (unknown) 
lattice period. Lx is the maximum value of the likelihood function and kx 
the number of free parameters in parameterization X . If X is constructed from 
parameterization Y by constraining some parameters in Y , X is rejected in favor 
of Y if 21n(Xy/Lx) is larger than the upper a-percentile of a x 2 distribution 
with (ky — kx) degrees of freedom, where 1 — a is the chosen level of confidence 
fl22T). A cannot be rejected in favor of N2; there is practically no difference. 
Even at only 80% confidence, B cannot be rejected in favor of N2 or A. An 
unrestricted fit of the behavior at high load to an N = 1 model gives the Nl 
parameters, reproducing the analysis of Ref. (|7|). 



[ATP] . Figure Eb shows the velocity relation for the A parameters with the 
load independent forward rate reduced by 50%. We see that the difference 
compared to the original A parameters vanishes for high load, in qualitative 
agreement with the experimental result (@). Adding the [ATP] dependence to 
the other step gives qualitatively different dependence (not shown). Also note 
that if [ATP] dependence is added to the restricted parameterizations B-D in 
this way, they still predict the same velocity for all loads and [ATP]. 

Conclusion 

We interpret pilus retraction data on wild type TV. gonorrhoeae in terms of a 
mechanochemical model, which is a discrete random walk with steps between 
nearest neighbors. Despite its simplicity, a description in these terms contains 
interestin g in formation about the free energy landscape of the retraction reac- 
tions (dill El). 

We find that the experimental data for retraction velocity is well described 
for all measured forces by several parameterizations, all of which have one effec- 
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tively force independent step and one irreversible step. The model also captures 
the qualitative behavior of varying [ATP], namely that the [ATP] dependence 
of velocity is strong at low force and very weak at high force (0). As expected, 
the binding reaction is not force dependent, but the unbinding might be. 

We find several different parameterizations that give similar or identical 
velocities, but make very different predictions for the diffusion constant D of 
the retraction. Without any assumptions regarding the elementary step d, we 
predict the randomness r = 2D/dv as shown in Fig. [2t, which would be of 
considerable interest to study experimentally. Such measurements may be able 
to distinguish the different parameterizations (A, B, C and D) and lead to 
additional important insight into the pilus retraction mechanism. 

We use a model that can account for the all measured forces, we make pre- 
dictions open to experimental test, and we get new result for the elementary 
step d: 0.5 nm for parameterization A and 0.2 nm for the B, C and D param- 
eters, compared to 0.1 nm for the N = 1 model 0). As we argued above, the 
short step may be an artifact of using a model with too few states to describe a 
restricted part of the force-velocity curve. Adding another state to account for 
the behavior at low loads changes the interpretation of the high load behavior. 

Even if we get at least twice as long step as in Ref . (0) , it is still not obvious 
how our result fits with the known facts about the system. Each pilin subunit 
contributes about 0.8 nm to the filament length 0; 0; Q; 0), hence one would 
like to account for at least one pilin subunit in a complete description of a 
reaction cycle. In this light, our work is a great improvement even if there 
are still a few Angstrom left to account for. One possibility is that a model 
with more states could account for the missing length, just like going from 
one state to two states did. Another is that the strong forces deform the sub- 
states and destroy the correspondence between load lengths and physical lengths 
iTtsl) . Since the existing data is well described by our model, this question must 
probably be settled experimentally. There is also a length associated with the 
involved energies and forces: at the maximal measured force, 160 pN, the free 
energy gain from hydrolysis of one ATP molecule under physiological conditions 
(~ 80 pN nm Ija llCl) ') is enough to retract 0.5 nm. The evidence that retraction 
is powered by ATP hydrolysis (0) does not imply that it is powered only by 
ATP hydrolysis. The above estimate indicates that more energy than that of 
one ATP is needed to retract one subunit. Since PilT forms a hexamer, it 
may hydrolyze up to six ATP in parallel, which would explain the high force 
energetically. Another possibility is that free energy from depolymerization of 
the filament is used in retraction. It might be possible to resolve details of 
for example a second binding event with a model with more states. However, 
models with more states have more parameters, so more experimental data is 
needed to explore these exciting possibilities. 

This work extends previous physical modeling of the PilT system 0) in 
several ways. In particular, the fluctuations (randomness parameter) emerge as 
a key property for further theoretical and experimental study of the dynamics 
of pilus retraction. 
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Figure 2: Force velocity relations for pilus retraction, parameterizations as de- 
fined in tabled (a) Experimental retraction data for "wild type" N. gonorrhoeae 
(0), full N = 2 parameterizations A (same as N2), and restricted N = 2 pa- 
rameterizations B, C and D. The experiment is well described by the N = 2 
model, and the result for A and N2 is almost indistinguishable from B, C and 
D. (b) High load analysis as in Ref. (7) with an N = 1 model (Nl) compared 
to the A parameterization. Also a model for an ATP-depleted version of A 
by setting uo — > 0.5uo- The ATP-depleted velocity tends to the non-depleted 
result (A) at high loads, in qualitative agreement with experimental findings 
(0). Also, the N = 1 and N = 2 models agree at high load, (c) Predicted 
randomness r — 2D/vd for the different parameterizations A (and N2), C and 
D. Parameters B is the limit Re — » of C, which is the lowest curve. The 



lowest curve of the D parameter family is the limit Rr> 



4(4 B; 



(B) 
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s . The C and D families predict qualitatively different behavior, which should 
make it possible to distinguish experimentally between them even though they 
give exactly the same velocity. Also note that 0.5 is a strict lower bound on 
randomness for models of this kind with N = 2 (|2S 
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